- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources4
- Resource Type
-
0002000002000000
- More
- Availability
-
40
- Author / Contributor
- Filter by Author / Creator
-
-
White, Martha (4)
-
Chandak, Yash (2)
-
Bürkner, Paul-Christian (1)
-
Chandramouli, Suyog (1)
-
Dubova, Marina (1)
-
Gigerenzer, Gerd (1)
-
Grünwald, Peter (1)
-
Hanna, Josiah P (1)
-
Holmes, William (1)
-
Jordan, Scott (1)
-
Kumaraswamy, Raksha (1)
-
Le, Lei (1)
-
Liu, Vincent (1)
-
Lombrozo, Tania (1)
-
Marelli, Marco (1)
-
Musslick, Sebastian (1)
-
Nicenboim, Bruno (1)
-
Niekum, Scott (1)
-
Ross, Lauren_N (1)
-
Shiffrin, Richard (1)
-
- Filter by Editor
-
-
Ravikumar, Pradeep (1)
-
null (1)
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Ravikumar, Pradeep (Ed.)We consider the task of evaluating a policy for a Markov decision process (MDP). The standard unbiased technique for evaluating a policy is to deploy the policy and observe its performance. We show that the data collected from deploying a different policy, commonly called the behavior policy, can be used to produce unbiased estimates with lower mean squared error than this standard technique. We derive an analytic expression for a minimal variance behavior policy -- a behavior policy that minimizes the mean squared error of the resulting estimates. Because this expression depends on terms that are unknown in practice, we propose a novel policy evaluation sub-problem, behavior policy search: searching for a behavior policy that reduces mean squared error. We present two behavior policy search algorithms and empirically demonstrate their effectiveness in lowering the mean squared error of policy performance estimates.more » « less
-
Dubova, Marina; Chandramouli, Suyog; Gigerenzer, Gerd; Grünwald, Peter; Holmes, William; Lombrozo, Tania; Marelli, Marco; Musslick, Sebastian; Nicenboim, Bruno; Ross, Lauren_N; et al (, Proceedings of the National Academy of Sciences)The preference for simple explanations, known as the parsimony principle, has long guided the development of scientific theories, hypotheses, and models. Yet recent years have seen a number of successes in employing highly complex models for scientific inquiry (e.g., for 3D protein folding or climate forecasting). In this paper, we reexamine the parsimony principle in light of these scientific and technological advancements. We review recent developments, including the surprising benefits of modeling with more parameters than data, the increasing appreciation of the context-sensitivity of data and misspecification of scientific models, and the development of new modeling tools. By integrating these insights, we reassess the utility of parsimony as a proxy for desirable model traits, such as predictive accuracy, interpretability, effectiveness in guiding new research, and resource efficiency. We conclude that more complex models are sometimes essential for scientific progress, and discuss the ways in which parsimony and complexity can play complementary roles in scientific modeling practice.more » « less
-
Chandak, Yash; Jordan, Scott; Theocharous, Georgios; White, Martha; Thomas, Philip (, Advances in neural information processing systems)null (Ed.)
-
Liu, Vincent; Kumaraswamy, Raksha; Le, Lei; White, Martha (, AAAI Conference on Artificial Intelligence)
An official website of the United States government

Full Text Available